Decision tree-based acoustic models for speech recognition
نویسندگان
چکیده
منابع مشابه
Decision Tree-Based Acoustic Models for Speech Recognition with Improved Smoothness
This article proposes a new acoustic model using decision trees (DTs) as replacements for Gaussian mixture models (GMM) to compute the observation likelihoods for a given hidden Markov model state in a speech recognition system. DTs have a number of advantageous properties, such as that they do not impose restrictions on the number or types of features, and that they automatically perform featu...
متن کاملDecision tree based rate of speech modeling for speech recognition
A real-world speech recognition system encounters several speaking styles and speaking rates and its accuracy depends highly on the speaking rate, i.e., degrades sharply with very fast or very slow speech (including hyperarticulated speech) In this paper, we propose a generic modeling scheme to capture a range of speaking rates from very slow to very fast with the use of decision trees. This ap...
متن کاملClass-triphone Acoustic Modeling Based on Decision Tree for Mandarin Continuous Speech Recognition
Decision tree based acoustic modeling has increasingly become popular for modeling speech spectral variations in continuous speech. In this paper, class-triphone acoustic models based on the decision tree are investigated for mandarin speakerindependent continuous speech recognition. Three main questions are discussed: how to select base phone models, how to generate the question set based on l...
متن کاملDecision tree acoustic models for ASR
This paper presents a summary of our research progress using decision-tree acoustic models (DTAM) for large vocabulary speech recognition. Various configurations of training DTAMs are proposed and evaluated on wall-street journal (WSJ) task. A number of different acoustic and categorical features have been used for this purpose. Various ways of realizing a forest instead of a single tree have b...
متن کاملDecision tree based text-to-phoneme mapping for speech recognition
In many embedded speech recognition systems, the phonetic transcriptions of the vocabulary items, i.e., the lexicons, cannot be stored to the device beforehand. A text-to-phoneme mapping functionality is hence needed to create the transcriptions from plain text. Several approaches have been evaluated in the literature. In this paper, a decision tree based text-to-phoneme mapping is studied. A d...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: EURASIP Journal on Audio, Speech, and Music Processing
سال: 2012
ISSN: 1687-4722
DOI: 10.1186/1687-4722-2012-10